Efficient Data Sharing over Large-Scale Distributed Communities

نویسندگان

  • Juan Li
  • Samee Ullah Khan
  • Qingrui Li
  • Nasir Ghani
  • Nasro Min-Allah
  • Pascal Bouvry
  • Weiyi Zhang
چکیده

Data sharing in large-scale Peer Data Management Systems (PDMS) is challenging due to the excessive number of data sites, their autonomous nature, and the heterogeneity of their schema. Existing PDMS query applications have difficulty to simultaneously achieve high recall rate and scalability. In this chapter, we propose an ontology-based sharing framework to improve the quality of data sharing and querying over large-scale distributed communities. In particular, we add a semantic layer to the PDMSs, which alleviates the semantic heterogeneity and assists the system to adjust its topology so that semantically related data sources can be connected. Moreover, we propose a comprehensive query routing and optimization Juan Li Department of Computer Science, North Dakota State University, Fargo, USA, e-mail: [email protected] Samee Ullah Khan Department of Electrical and Computer Engineering, North Dakota State University, Fargo, USA, e-mail: [email protected] Qingrui Li Department of Computer Science, North Dakota State University, Fargo, USA,e-mail: [email protected] Nasir Ghani Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, USA, e-mail: [email protected] Nasro Min-Allah Department of Computer Science, COMSATS Institute of Information Technology, Islamabad, Pakistan, e-mail: [email protected] Pascal Bouvry Faculty of Sciences, Technology and Communications, University of Luxembourg, Luxembourg, e-mail: [email protected] Weiyi Zhang Department of Computer Science, North Dakota State University, Fargo, USA, e-mail: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BlobSeer: Towards efficient data storage management for large-scale, distributed systems

ions that enable high-performance data sharing at large scale, otherwise the huge computational potential offered by large distributed systems is hindered by poor data sharing scalability. While this problem is well known, existing approaches still face many limitations that need to be overcome.

متن کامل

E2DR: Energy Efficient Data Replication in Data Grid

Abstract— Data grids are an important branch of gird computing which provide mechanisms for the management of large volumes of distributed data. Energy efficiency has recently emerged as a hot topic in large distributed systems. The development of computing systems is traditionally focused on performance improvements driven by the demand of client's applications in scientific and business domai...

متن کامل

A Cluster-enhanced Fault Tolerant Peer-to-peer System

Over the Internet today, computing and communications environments are more complex and chaotic than classical distributed systems, lacking any centralized organization or hierarchical control. Peer-to-Peer network overlays provide a good substrate for creating large-scale data sharing, content distribution and application-level multicast applications. We present a fault-tolerant, cluster-enhan...

متن کامل

An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity

The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...

متن کامل

Godis: Ontology-based Resource Discovery and Integration in Grids

This paper presents GODIS (Grid Ontological Directory and Integration System) a comprehensive architecture for resource sharing and discovery in large-scale grids, where nodes integrate local ontologies to expand semantic knowledge of shared grid resources. A peer-to-peer based DHT overlay is used to facilitate the formation of semantic communities on a large scale. Inside communities, ontologi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011